A general method for sifting linguistic knowledge from structured terminologies

نویسندگان

  • Natalia Grabar
  • Pierre Zweigenbaum
چکیده

Morphological knowledge is useful for medical language processing, information retrieval and terminology or ontology development. We show how a large volume of morphological associations between words can be learnt from existing medical terminologies by taking advantage of the semantic relations already encoded between terms in these terminologies: synonymy, hierarchy and transversal relations. The method proposed relies on no a priori linguistic knowledge. Since it can work with different relations between terms, it can be applied to any structured terminology. Tested on SNOMED and ICD in French and English, it proves to identify fairly reliable morphological relations (precision > 90%) with a good coverage (over 88% compared to the UMLS lexical variant generation program). For English words with a stem longer than 3 characters, recall reaches 98.8% for inflection and 94.7% for derivation.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Presenting a method for extracting structured domain-dependent information from Farsi Web pages

Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...

متن کامل

Towards a Standardized Linguistic Annotation of the Textual Content of Labels in Knowledge Representation Systems

We propose applying standardized linguistic annotation to terms included in labels of knowledge representation schemes (taxonomies or ontologies), hypothesizing that this would help improving ontology-based semantic annotation of texts. We share the view that currently used methods for including lexical and terminological information in such hierarchical networks of concepts are not satisfactor...

متن کامل

Thesauri and formal classifications: terminologies for people and machines.

Terminologies are now software. They are key components of the integration of electronic patient records, decision support systems and information retrieval systems. To be used as software, the different types of content in traditional terminologies must be separated, which we term here: conceptual, linguistic, inferential and pragmatic. The conceptual knowledge at the heart of the terminology ...

متن کامل

Application of standardized biomedical terminologies in radiology reporting templates

The Radiological Society of North America (RSNA) has been promoting structured radiology reports by creating "best practices" reporting templates. The RSNA Reporting Template Library has been developed with the goal of integrating reusable knowledge into the clinical reporting process, which has intentionally incorporated standardized biomedical terminologies to reduce communication errors caus...

متن کامل

Exploring Reading Comprehension Needs of Yasouj EAP Students of Persian Literature

Abstract The main objective of the current English for Academic Purposes (EAP) programs in Iran is to fill the gap between the students’ general English competence and their ability to read discipline-specific texts. This study aims to investigate the target and present reading comprehension needs of EAP undergraduate students of Persian literature in Yasouj state university through a mixed met...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Proceedings. AMIA Symposium

دوره   شماره 

صفحات  -

تاریخ انتشار 2000